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Based on an 87-fb~^ dataset collected by the BABAR detector at the PEP-II asymmetric-energy 

B-Factory, a search for D'^-D mixing has boon rnado using the semileptonic decay modes _D*+ 
%^D'^,D'^ K'^*^ev (+C.C.). The use of these modes allows unambiguous flavor tagging and a 
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combined fit of tfie _D° decay time and D*'^-D^ mass difference (AM) distributions. Tfie fiigfi- 
statistics sample of unmixed semileptonic D° decays is used to model tfie AM distribution and 
time-dependence of mixed events directly from the data. Neural networks are used to select events 
and reconstruct the A result consistent with no charm mixing has been obtained, _Rmix = 

0.0023 ± 0.0012 ± 0.0004. This corresponds to an upper limit of ii^ix < 0.0042 (90% CL). 

PACS numbers: 13.25.Ft, 12. 15. Ft, ll.30.Er 



In the Standard Model (SM), D^-D^ mixing typically 
proceeds through box diagram loops involving down-type 
quarks. This results in very effective GIM and CKM 
suppressions of the mixing rate. The expected rate, rel- 
ative to the unmixed rate, of D^-D^ mixing through SM 
box and di-penguin diagrams is 0(10^'* — lO^^*'), 
many orders of magnitude below the current experimen- 
tal sensitivity of 0(10"'^) 0. Possible enhancements to 
the SM mixing rate involve non-perturbative effects^ and 
predictions range over many orders of magnitude jJI ap- 
proximately bounded by the box diagram rate and the 
experimental sensitivity. New physics charm mixing pre- 
dictions span the same large range and, therefore, the 
presence of a mixing signal alone would not be a clear 
indication of new physics. However, the current experi- 
mental bounds on charm mixing already constrain new 
physics models. 

Charm mixing is generally characterized by two dimen- 
sionless parameters, x = Am/F and y = Ar/2r, where 
Am = m2 — mi (AT = r2 — Fi) is the mass (width) dif- 
ference between the two neutral D mass eigenstates and 
F is the average width. If either a; or y is non-zero, then 
Jj0_]j0 j]2ixing will occur. 

The time evolution of a neutral D meson depends on 
the type of final state into which it decays, since dou- 
bly Cabibbo-suppressed (DCS) decays to the same final 
states as mixed events allow for interference between the 
mixing and DCS amplitudes |^. Semileptonic decays, for 
which there are no DCS amplitudes, have a particularly 
simple time-dependence, 

T ■ (t)^T ■ (t) ^'^ ^y"^ ( fi) 
4 \tdo J 

where t is the proper time of the decay, T^ni-aix{i) 
q-*/^do ^ and the approximation is valid in the limit of 
small mixing rates. Sensitivity to x and y individually is 
lost with semileptonic final states. The time-integrated 
mixing rate i?niix relative to the unmixed rate is 

x'^ -t- y^ 

-^mix — ^ ■ (2) 

We present a measurement of i?mix using an 87-fb^^ 
data sample collected on and just below the T(4S') 
resonance with the BABAR detector at the PEP-II 
asymmetric-energy e+e" storage ring. A full BABAR de- 
tector Monte Carlo simulation based on GEANT4 |2| is 
used to study background sources and to develop meth- 



ods to reconstruct neutral D mesons decaying to semilep- 
tonic final states. 

Charged tracks are measured in a five-layer silicon ver- 
tex tracker (SVT) surrounded by a cylindrical wire drift 
chamber (DCH). Charged hadrons are identified through 
dE/dx measurements in the tracking volume and in a 
ring-imaging Cherenkov detector (DIRC) surrounding 
the DCH. Identified charged K purities averaged across 
all momenta are ^99%, with ~80% reconstruction effi- 
ciency. Relative charge asymmetries between and 
K~ in both misidentification rate and reconstruction ef- 
ficiency are at most a few percent across all momenta 
and contribute negligibly to the total error on the mix- 
ing rate. Charge conjugation is implied unless explicitly 
stated otherwise. 

Electromagnetic showers from electrons and photons 
are detected by Csl crystals arrayed in an electromag- 
netic calorimeter (EMC) located between the DIRC and 
the superconducting solenoid. Electrons are differenti- 
ated from other charged tracks by the intrinsic differ- 
ences between hadronic, muonic and electronic energy 
depositions in the EMC. As with charged K candidates, 
there are relative charge asymmetries of at most a few 
percent in both misidentification rate and reconstruction 
efficiency between positrons and electrons, which have a 
negligible contribution to the total error on the mixing 
rate. Electrons and positrons arising from gamma con- 
versions are eliminated from the pool of tracks used to 
reconstruct D° candidates. 

Neutral D candidates are selected by reconstructing 
the decay chain D*+ t:+D",D° -> K^*hiy. There are 
no essential differences for this analysis between the K 
and K* modes, either theoretically or empirically, and 
thus no attempt is made to reconstruct the K* — its 
charged K daughter is treated as if it was a direct daugh- 
ter of the Z?°. The charge of the pion daughter of the 
charged D* identifies the production flavor of the neutral 
D, while the charge of the electron identifies the decay 
flavor. These charges are the same for unmixed decays 
and different for mixed decays, denoted as right-sign (RS) 
and wrong-sign (WS) decays, respectively. 

The measured mixing rate is parameterized as i?inix = 
nws/nRs- We fit the number of RS (WS) signal de- 
cays, riRs (?^ws): with a likelihood combining the £)*+- 
mass difference (AM) distribution with the unmixed 
and mixed (Eq. ^ decay time distributions, respectively. 
The high-statistics sample of RS signal events is used to 
model the AM distribution and the time-dependence of 
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mixed events directly from the data. To avoid potential 
bias, we perform a blind analysis in which the event se- 
lection criteria, and procedures for fitting the data and 
extracting an upper limit (UL), are determined prior to 
examining the WS signal region AM and decay time dis- 
tributions in the data. 

Identified K and e candidates of opposite charges are 
combined to create neutral D candidate decay vertices. 
Only candidates with vertex fit probability > 0.01 and 
mass < 1.82 GeV/c^ are retained. The value of the mass 
cut is chosen to exclude all hadronic two-body decays. 
The average PEP-II interaction point (IP), measured on 
a run-to-run basis using Bhabha and events, is 

taken as the production point of D° candidates. The 
transverse decay time is used due to the relative nar- 
rowness of, and small errors on, the IP in the transverse 
{r-(j)) plane, as opposed to the IP's longitudinal (z) pro- 
file. Because of the very small transverse distance trav- 
eled by B mesons prior to decay, there are no significant 
differences between the decays of charmed parents arising 
from the continuum {prompt events) and those coming 
from the decay of B mesons, other than the harder 1)° 
momentum spectrum from prompt events. 

The pions from D*^ decays are relatively soft tracks 
with p* < 450 MeV/c, where the asterisk denotes a pa- 
rameter measured in the T(4S') center-of-mass frame. 
Charged tracks identified as either a charged K or e 
candidate are not considered as tt candidates. To reject 
poorly reconstructed tracks, tt candidates are required to 
have six or more SVT hits, with > 2 r-(f> layer hits, > 2 
z layer hits, at least one hit on the inner three r-ff) layers 
and at least one hit on the inner three z layers. Pion 
candidates are also refit with the IP as a constraint and 
accepted only if the refit probability is greater than 0.01. 

The individual K, e and tt tracks, the K-e vertex, and 
the event thrust are used to reconstruct the three orthog- 
onal components of the momentum vector with three 
JetNet 3.4 (Sj neural networks (NN's). Each NN has two 
hidden layers, and is trained and tested with a large sam- 
ple (0(10^)) of simulated signal events. The following 
vector inputs to the NN's are used: the momentum of the 
K-e pair constrained by a vertex fit [p*(i4r-e)], p*(7r) and 
the event thrust vector T (calculated using all charged 
and neutral candidates except the K and e candidates). 
In addition, four opening angles among the above pa- 
rameters are used: 6*{pk-c,T), 9*{pK,Pe), 0*{Pt,,T), 
and 0*(p;f_e, Ptt)- The vector inputs are separated into 
their respective orthogonal components (azimuthal/polar 
angles and magnitude) and, additionally using the four 
scalar inputs, separate NN's for p*^{D^), pg{D°), and 
|p*(D°)| are trained with the seven input parameters. 
The residuals between the true p*{D'~') direction for sim- 
ulated signal events and the NN output are unbiased and 
Gaussian-distributed with a ~110 mrad. The distribu- 
tion of momentum magnitude residuals shows an error of 
cTp/p ^10% for a typical signal event. 



The D*^-D^ mass difference is calculated using the 
D° candidate and the tagging pion candidate. The sig- 
nal AM distribution peaks ~3 MeV/c^ below the PDG 
value of 145.4 MeV/c^ and has a RMS width of 
~2.2 MeV/c^. The shift in the peak introduces no bias 
into the measurement of the mixing rate and is due to 
the use of tagging pion information in reconstructing the 
momentum. A ~80 MeV/c^ wide AM sideband is 
retained from 160 MeV/c^ to 240 MeV/c^ in order to 
characterize the level of background events. 

The transverse momentum of a Z?*^ candidate, and the 
projections of the IP and K-e vertex loci on the r-(j) plane, 
are used to calculate the candidate's proper decay time. 
The error on the decay time, calculated using only the er- 
rors on the IP and K-e vertex, is typically 0.6t]jo, where 
Tjjo is the nominal mean D'^ lifetime. The contribution of 
the p*{D^) estimator to the total decay time uncertainty 
is negligible. Poorly reconstructed events with calculated 
decay time uncertainties greater than 2tdo are discarded. 
Only events with decay times between — 12rpo and Idr^o 
are retained. 

In addition to the above criteria, events are selected 
using a neural network trained to distinguish prompt 
charm signal from background events. The event selec- 
tor NN uses a five-element input vector: p^, |T|, 
9*{pK-e, T), and 9*{pk, Pe)- It has a single hidden layer 
of nine nodes and is also constructed using JetNet 3.4. A 
cut is made on the event selector NN output such that the 
statistical sensitivity to mixed charm events is optimized 
for the 87-fb~^ dataset used. 

Events with multiple candidates meeting all of the 
above selection criteria but which share tracks are dis- 
carded, leading to a loss of ~10% of signal events passing 
all other cuts. There are no differences in RS and WS 
reconstruction efficiencies in the final event selection. 

Backgrounds predominantly come from prompt charm 
events, with minor contributions from uds and 66 events. 
The backgrounds from misidentified charged particle 
species are negligible. Nearly all background events come 
from and semileptonic decays to final states in- 
cluding both a charged K and an e that are combined 
with a random tt'^ , and truly random combinatorics in 
which the K and e do not share a common charm par- 
ent. The decay time distributions of background charm 
parents differ somewhat from those of well-reconstructed 
charm parents because the p*(Z?°) estimator returns a 
correct momentum estimate only for signal events. The 
decay time probability density functions (PDF's) for 
these charm backgrounds are thus not fit from the data 
but, instead, are derived from simulated events. The de- 
cay time distribution for truly random (i.e., zero-lifetime) 
events is fit in the data using a sum of three Gaussians. 

The AM backgrounds qualitatively fall into two cate- 
gories — those which peak under the signal AM distri- 
bution and those which do not. Both RS and WS events 
have small peaking AM backgrounds. In the WS sam- 
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pie, the peaking component arises from semi-electronic 
D"*" decays with a K* daughter — these events are fit 
using the D"*" decay time PDF and are thus differenti- 
ated from mixed signal events. In the RS sample, the 
fraction of background events sharing the RS unmixed 
signal PDF is estimated from fits to simulated datasets 
and a correction of ~4% is made to the fit number of 
RS signal decays to obtain the actual number. The RS 
_D+ AM background peaks well away from the AM sig- 
nal region and its shape is fixed from simulated events. 
The shape is floated as a systematic check and there is a 
negligible effect on the final value of i?mix- 

Non-peaking combinatoric AM backgrounds are mod- 
eled from the data by combining K-e vertex and tt can- 
didates from different events ("event mixing"). Events 
are selected from the off-resonance data such that a K-e 
vertex drawn from one event may be combined with a tt 
candidate from any other event. Approximately 70,000 
off-resonance data events are used to generate 0(10^) RS 
and WS pseudo-events passing the final event selection 
criteria. A Kolmogorov-Smirnov test 10] shows that the 
event-mixed RS and WS AM distributions have a ~92% 
probability of sharing the same underlying parent distri- 
bution. The shape of the PDF associated with the non- 
peaking combinatoric AM background is taken from the 
combined event-mixed RS and WS AM distributions and 
denoted by iJ(AM). 

An initial fit to the RS data is used to extract the 
unmixed signal mean lifetime, shape of the signal AAI 
distribution, and number of RS signal events. The mixed 
WS signal PDF parameters are taken from this high- 
statistics fit. The full extended likelihood function is: 



Nl 



i=i j=i 



(3) 



where fj {rij ) is the fraction (number) of events assigned 
to event class j, A = is the fit total number of 

candidates, Vj is the value of the PDF for event class j, 
and the extended product runs over all N events. The 
PDF's for RS and WS signal and background classes are 
detailed below. The shapes of all PDF's are fit in the 
data unless otherwise noted. All fits are done as unbinned 
maximum likelihood fits. 

Four classes of events are fit in the RS data: unmixed 
signal events (nunmix) , background 1?° decays (n^j^gdoo ) , 
RS decays (".rsd+) and completely random three- 
track combinations (nzoro-ufo)- The RS signal AM shape 
is fit to a threshold function with a power-law turn-on 
and an exponentially decaying tail, while the decay time 
distribution is fit as an exponential decay with floating 
decay constant convoluted with a resolution model (G) 
taken from simulated signal events. The RS signal AM 



and decay time combined PDF is: 

3 

Knmix = exp(^ a,u') ■ [e-'/-n°eict > 0) (g> G], (4) 

1=1 

where u = AM — M^+, and Mj^+ is fixed to the nominal 
PDG value Q. The four parameters of the signal AM 
PDF (7, ai), Tuo and the unmixed signal population are 
allowed to float in the RS fit. 

The background _D" event class uses the combined 
PDF: 



T'bkgdDO =i?(AM)-Too(t), 



(5) 



where TDo[t) represents the shape of the decay time 
distribution for this event class derived from simulated 
events. The RS event class has the combined PDF: 



RSD + 



where ( and are fixed from fits to simulated events, 
and T£)+ (t) represents the shape of the decay time distri- 
bution for this event class derived from simulated events. 
Finally, zero-lifetime events use the combined PDF: 



"Pzoro-iifo = H{AM) ■ ^ fi exp 



1 f fi — ct 
2 



(7) 



with the constraint /s = 1 — /i — /2. Fifteen parameters 
are floated in the RS fit: the signal AM shape param- 
eters and mean D° hfctime; the mean, widths and rel- 
ative fractions of the zero-lifetime triple Gaussian; and 
the population of the four RS event classes. 

Figure ^ shows the projections of the RS fit model 
onto the RS data. After correcting for peaking RS back- 
ground events as described above, we find nunmix — 
49620 ± 265 events. The fit value for the unmixed D° 
mean lifetime is, within its statistical error of ^^0.6%, 
consistent with the current PDG If? value. 

Five classes of events are fit in the WS data: mixed 
signal events (rimix)j background decays (ribkgdDo)) 
peaking WS Z)+ decays (f^pkngD+); non-peaking WS D"*" 
decays (f^non-pkngD+) E^nd completely random three-track 
combinations (r^zoro-ufc)- WS signal events share the 
sharply peaked AM distribution of the RS signal, but 
have the modified lifetime distribution given by Eq. ^ 
The full form of the WS mixed signal PDF is: 



T^mix = exp(^ aiu') 



-*/^o«6'(ci >0)«)G], (8) 



i=l 



where ai, 7 and tjjo are fixed to the values found in 
the RS unmixed fit. The background D'^ and random 
combinatoric event classes use the combined likelihoods 
shown in Eqs. |S1 and [7| respectively. The peaking WS 
Z)+ event class uses the fixed signal AM PDF and the 
_D+ lifetime model: 

7^pkngD+ = u'' exp{aiu + a2u'^ + a3U^) ■ To+it). (9) 
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FIG. 1: AM (left) and decay time (right) projections of fit 
(solid lines) to RS data (points): (top left) AM signal region 
— unmixed signal (above dashed line), background (dashed 
line); (bottom left) magnified vertical scale AM full fit re- 
gion — unmixed signal (white) , background (light grey) , 
background (dark grey) , zero-lifetime background (black) ; 
(top right) decay time signal region — signal and background 
components as in bottom left plot; (bottom right) magnified 
vertical scale decay time full fit region — signal and back- 
ground components as in bottom left plot of this figure. 



Non-peaking WS events use the likelihood: 

H{AM)-TD+{t). 



non — pkngD+ 



(10) 



Eleven parameters are floated in the WS fit: the mean, 
widths and relative fractions of the zero-lifetime triple 
Gaussian and the populations of the five WS event 
classes. Figure |21 shovifs the projections of the WS fit 
model onto the WS data. We find rimix = 114±61 events, 
leading to a value of = 0.0023 ± 0.0012 (stat). Fits 
to toy Monte Carlo data sets show that, for an assumed 
zero mixing rate, a fit number of mixed events greater 
than the result here is likely to occur in about 5% of ex- 
periments. No significant asymmetries are seen when the 
WS dataset is divided and fit based on the production 
flavor of the D°-D°. 

Goodness-of-fit is checked by comparing the minimized 
negative log likelihood (NLL) values of the RS and WS 
data fits with NLL distributions generated from toy MC 
— the data-fit NLL values lie well within the range pre- 
dicted by the toy fits. The WS fit model is also tested 
for bias and correct error scaling with fits to simulated 
datasets containing 0, 50 and 100 mixed WS events 
(^^mix ^ 0, 0.001, 0.002, respectively), and no evidence 
of bias or improperly scaled errors is seen. 

The systematic error includes variations of the WS 
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FIG. 2: AM (left) and decay time (right) projections of fit 
(solid lines) to WS data (points): (top left) AM signal re- 
gion — mixed signal (above dashed line) , background (dashed 
line); (bottom left) AM full fit region — D" background 
(white), zero-lifetime background (dark grey), non-peaking 
D"*" background (intermediate grey) , peaking D"*" background 
(light grey), mixed signal (black); (top right) decay time sig- 
nal region — signal and background components as in bottom 
left plot; (bottom right) magnified vertical scale decay time 
full fit region — signal and background components as in bot- 
tom left plot. 



combined mixed signal PDF, random combinatoric AM 
PDF shape, signal decay time resolution model, and 
background D° and decay time PDF's. By far, 
the dominant systematic is the statistical precision with 
which the RS AM PDF is known. This systematic is 
characterized by sequentially varying by ±la one of the 
signal AM shape parameters floated in the RS fit, along 
with the mean D'^ lifetime, and refitting the WS data. 
The resulting deviations from the central value of nmix 
obtained in the nominal fit to the WS data are summed 
in quadrature, taking into account correlations among 
the fit parameters. Table shows the error on i?inix at- 
tributable to each source of systematic error listed above. 
There are no significant effects on i?niix attributable to 
the choice of vertexing algorithm, IP, K or e particle 
identification, NN event selector cut, decay time error 
cut, signal resolution model or AM sideband cut. Tak- 
ing the total systematic error as the sum in quadrature 
of the values in Tabled cr^^^_^ = 0.0004 = 0.34cr|j*^*^. 

A scan of the change in NLL for rimix values in the 
region surrounding the fit minimum is used to calculate 
upper limits. By construction, the NLL scan includes 
only the statistical error of the fit — the systematic er- 
ror is included as a small perturbation on the values of 
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TABLE I: Systematic variations in the value of Ru 



Varied parameter 


1 A_Rmix| 


Mixed AM PDF 
Mixed decay time PDF 
Random combinatoric AAI shape 
Bkgd decay model 
Bkgd decay model 


0.00032 
0.00008 
0.00015 
0.00016 
0.00012 
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ANLL used to establish confidence intervals. The total 
error is taken as the sum in quadrature of the statisti- 
cal and systematic errors, cr*"*'^' = vT+"0^34^ cr'**"* — 
1.06 cr"*"*. The 95% CL UL is taken as the value of 
where the NLL value changes from its minimum by 
ANLL = (0.5)(1.06)(1.962)(0.97) = 1.97, where a one- 
sigma change is ANLL = 0.5, which yields i?mix< 0.0046 
(95% CL). The factor of "0.97" in the preceding expres- 
sion for the UL arises from the fraction of the a posteriori 
distribution of rimix lying in the physical region. A sim- 
ilar calculation shows i?i„ix<0.0042 at the 90% CL. The 
relatively small error (~0.5%) on n^g is negligible and 
has been ignored. 

In summary, we have obtained a value of i?mix con- 
sistent with no mixing and set a limit of _Rniix<0.0042 
(0.0046) at the 90% (95%) CL. This is slightly lower than 
the previous best published limit using semileptonic de- 
cays of i?mix < 0.005 (90% CL) set by E791 ,11]. How- 
ever, the positive result obtained for rimix obscures the 
actual sensitivity of the analysis. The current best pub- 
lished UL on i?,„ix (< 0.0016, 95% CL) comes from the 
BABAR hadronic Kir mixing analysis [l3| . Had the cen- 
tral value of rimix been observed here to be zero, the 90% 
CL (95%) UL for the mixing rate would have been 0.0016 
(0.0020), comparable to both the hadronic BABAR result 
and a recent semileptonic FOCUS analysis [l3| . 

We are grateful for the excellent luminosity and ma- 
chine conditions provided by our PEP-II colleagues, and 
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